Segmentation of Overlapped Handwritten Arabic Sub-Words
نویسندگان
چکیده
Arabic script is cursive in both handwritten and printed form. Segmentation of Arabic scriptespecially handwrittenis a very challenging task. Many difficulties arise due to the inherent characteristics of Arabic writing such as the overlapping of Arabic sub-words wherein the sub-words share the same vertical space, and vertical ligatures wherein characters are stacked upon each other in a word. In this paper, an algorithm to resolve the overlapping of handwritten Arabic sub-words is introduced. The proposed algorithm is based on pushing strategy;
منابع مشابه
Automatic Segmentation for Arabic Character Handwriting
The cursive and ligature nature of the Arabic language make the segmentation of words into individual characters a difficult task. Despite attempts to apply methods for cursive Latin and other languages to Arabic, it is generally insufficient to segment Arabic text. This paper proposes a new segmentation algorithm for handwritten Arabic text and the main idea consist of segmenting the word into...
متن کاملArabic Handwritten: Pre-Processing and segmentation
This paper is concerned with pre-processing and segmentation tasks that influence the performance of Optical Character Recognition (OCR) systems and handwritten/printed text recognition. In Arabic, these tasks are adversely effected by the fact that many words are made up of sub-words, with many sub-words there associated one or more diacritics that are not connected to the sub-word’s body; the...
متن کاملComponent-based Segmentation of Words from Handwritten Arabic Text
Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words se...
متن کاملSegmenting Arabic Handwritten Documents into Text lines and Words
In this paper, we present a method for segmenting Arabic handwritten documents into text lines and words. Text line segmentation is addressed by a well-known technique, the horizontal projection profile, in which autocorrelation is used to enhance the self similarity of this profile. This technique promotes the estimation of text line spacing. Word extraction is based on an adaptation of a know...
متن کاملA New Arabic (ahd/amsh) Handwritten Database
This paper introduces new database for Arabic handwritten words. The Arabic handwritten database (AHD/AMSH) represents a utility to facilitate the experiments of the character recognition algorithms. It contains three types of images: word, isolated character, and digit images. The AHD/AMSH can be used for baseline detection, characters segmentation, normalization, thinning, training and testin...
متن کامل